Speaker recognition with cough, laugh and "Wei"

نویسندگان

Miao Zhang

Yixiang Chen

Lantian Li

Dong Wang

چکیده

This paper proposes a speaker recognition (SRE) task with trivial speech events, such as cough and laugh. These trivial events are ubiquitous in conversations and less subjected to intentional change, therefore offering valuable particularities to discover the genuine speaker from disguised speech. However, trivial events are often short and idiocratic in spectral patterns, making SRE extremely difficult. Fortunately, we found a very powerful deep feature learning structure that can extract highly speaker-sensitive features. By employing this tool, we studied the SRE performance on three types of trivial events: cough, laugh and “Wei” (a short Chinese “Hello”). The results show that there is rich speaker information within these trivial events, even for cough that is intuitively less speaker distinguishable. With the deep feature approach, the EER can reach 10%-14% with the three trivial events, despite their extremely short durations (0.21.0 seconds).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improved I-vector-based Speaker Recognition for Utterances with Speaker Generated Non-speech sounds

Conversational speech not only contains several variants of neutral speech but is also prominently interlaced with several speaker generated non-speech sounds such as laughter and breath. A robust speaker recognition system should be capable of recognizing a speaker irrespective of these variations in his speech. An understanding of whether the speaker-specific information represented by these ...

متن کامل

Human and Machine Speaker Recognition Based on Short Trivial Events

Human speech often has events that we will call trivial events, e.g., cough, laugh and sniff. Compared to regular speech, these trivial events are usually short and variable, thus generally regarded as not speaker discriminative and so are largely ignored by present speaker recognition research. However, these trivial events are highly valuable in some particular circumstances such as forensic ...

متن کامل

Speaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation

A variety of methods are used for speaker adaptation in speech recognition. In some techniques, such as MAP estimation, only the models with available training data are updated. Hence, large amounts of training data are required in order to have significant recognition improvements. In some others, such as MLLR, where several general transformations are applied to model clusters, the results ar...

متن کامل

Speaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation

متن کامل

The cough/laugh syndrome: MR evaluation.

The cough/laugh syndrome is characterized by the onset of severe headache immediately after an episode of coughing, laughing, or straining. It has been associated with pathologic changes in the posterior fossa, in particular, cerebellar tonsillar herniation or ectopia [1-3]. MR imaging is important in the evaluation of craniocervical anomalies, particularly Chiari malformations [4, 5]. We prese...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2017

Speaker recognition with cough, laugh and "Wei"

نویسندگان

چکیده

منابع مشابه

Improved I-vector-based Speaker Recognition for Utterances with Speaker Generated Non-speech sounds

Human and Machine Speaker Recognition Based on Short Trivial Events

Speaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation

Speaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation

The cough/laugh syndrome: MR evaluation.

عنوان ژورنال:

اشتراک گذاری